Threshold-based declustering

نویسنده

  • Ali Saman Tosun
چکیده

Declustering techniques reduce query response time through parallel I/O by distributing data among multiple devices. Except for a few cases it is not possible to find declustering schemes that are optimal for all spatial range queries. As a result of this, most of the research on declustering has focused on finding schemes with low worst case additive error. However, additive error based schemes have many limitations including lack of progressive guarantees and existence of small non-optimal queries. In this paper, we take a different approach and propose threshold-based declustering. We investigate the threshold such that all spatial range queries with buckets are optimal. Upper bound on threshold is analyzed using bound diagrams and a number theoretic algorithm is proposed to find schemes with high threshold value. Threshold-based schemes has many advantages: they have low worst-case additive error, provide progressive guarantees by dividing larger queries into subqueries with buckets, can be used to compare replicated declustering schemes and render many large complementary queries optimal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Iterative-improvement-based declustering heuristics for multi-disk databases

Data declustering is an important issue for reducing query response times in multi-disk database systems. In this paper, we propose a declustering method that utilizes the available information on query distribution, data distribution, data-item sizes, and disk capacity constraints. The proposed method exploits the natural correspondence between a data set with a given query distribution and a ...

متن کامل

Efficient Data-Sensitive Techniques for Parallel Retrieval of Keyword-indexed Information

Keyword based search of data, such as documents, maps, images, audio and video data, is an everyday activity for many millions of people with myriad uses, e.g., scientific computing, digital libraries, the web, catalogs, geographical information systems, music servers, etc. In this paper, we present several declustering algorithms based on existing similarity measures as well as their generaliz...

متن کامل

Declustering Spatial Objects by Clustering for Parallel Disks

In this paper, we propose an eÆcient declustering algorithm which is adaptable in di erent data distribution. Previous declustering algorithms have a potential drawback by assuming data distribution is uniform. However, our method shows a good declustering performance for spatial data regardless of data distribution by taking it into consideration. First, we apply a spatial clustering algorithm...

متن کامل

cient Disk Allocation for Fast Similarity Searching

As databases increasingly integrate non-textual information it is becoming necessary to support eecient similarity searching in addition to range searching. Recently, declustering techniques have been proposed for improving the performance of similarity searches through parallel I/O. In this paper, we propose a new scheme which provides good declus-tering for similarity searching. In particular...

متن کامل

Declustering Objects for Visualization

In this paper we propose a new declustering method which is particularly suitable for image and cartographic databases used for visualization. Our declustering method is based on algebraic techniques using vectors. The algorithm which computes the disk assignment requires O(Kj log K) time where K is the number of parallel disks in the system. The resulting disk assignment maximizes the area tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 177  شماره 

صفحات  -

تاریخ انتشار 2007